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ORIGINAL ARTICLE 

Copy-neutral loss of heterozygosity is prevalent and a late event 
in the pathogenesis of FLT3/YTD AML 

DL Stirewalt 1 , EL Pogosova-Agadjanyan 1 , K Tsuchiya 1,2,3 , J Joaquin 1 and S Mesh inchi 1,4,5,6 

Patients with high FLT3 internal tandem duplication allelic ratios (FZT3/ITD-ARs) have a poor prognosis. Single-nucleotide 
polymorphism/comparative genomic hybridization, single-cell PCR and colony-forming assays were used to evaluate genotypic 
evolution of high FZ. 73/ITD-ARs in 85 acute myeloid leukemia (AML) patients. Microarrays were used to examine molecular pathways 
disrupted in leukemic blasts with high FZ. 73/ITD-ARs. Copy-neutral loss of heterozygosity (CN-LOH) was identified at the FLT3 locus 
in diagnostic samples with high FLT3/\JD-ARs (A/=11), but not in samples with low FLT3/\JD-ARs (A/ = 24), F/T3-activating loop 
mutations (A/ = 11) or wild-type FLT3 (N = 39). Single-cell assays showed that homozygous FLT3/\JD genotype was present in 
subsets of leukemic blasts at diagnosis but became the dominant clone at relapse. Less differentiated CD34 + /CD33" progenitor 
colonies were heterozygous for FLT3/\JD, whereas more differentiated CD34 + /CD33 + progenitor colonies were homozygous 
for F/.73/ITD. Expression profiling revealed that samples harboring high F/.73/ITD-ARs aberrantly expressed genes within the 
recombination/DNA repair pathway. Thus, the development of CN-LOH at the FLT3 locus, which results in high FZ. 73/ITD-ARs, 
likely represents a late genomic event that occurs after the acquisition of the FLT3/ITD. Although the etiology underlying the 
development of CN-LOH remains to be clarified, the disruption in recombination/DNA repair pathway, which is present before 
the development of LOH, may have a role. 
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INTRODUCTION 

Activating mutations of the FLT3 gene occur as a result of an internal 
tandem duplication (FZT3/ITD) of the juxtamembrane domain-coding 
sequence or a point mutation within the activation loop domain 
(FZT3/ALM). 1 " 3 Although both mutations lead to the constitutive 
activation of the receptor, only presence of FLT3/WD has consistently 
been associated with clinical outcome. 4-13 We and others have 
demonstrated that allelic ratio (AR) of FLT3/WD has prognostic 
significance, such that those patients with a high FZT3/ITD-AR have 
an inferior outcome compared with those with a low FLT3/WD- 
AR. 8,12 " 14 The central mechanism underpinning the variation in AR 
and its prognostic significance remains poorly understood. 

Analyses of microsatellite markers have demonstrated that a 
small subset of patients with very high ARs actually harbor a loss 
of heterozygosity (LOH) of the FLT3 gene at chr13q locus. 9,15,16 
Recent reports suggest that copy-neutral LOH (CN-LOH), which is 
also known as acquired uniparental disomy, may have a role in the 
development of high F/.73/ITD-AR within leukemic blasts. 17,18 
However, additional studies are warranted to better characterize 
the frequency of CN-LOH in patients with FLT3/\JD and how this 
process may contribute to the development of high F/.73/ITD-AR. 
In this study, single-nucleotide polymorphism/comparative 
genomic hybridization (SNP/CGH) profiling was used to better 
define the underlying mechanism and temporal evolution of 
allelic variation in AML blasts containing FLT3 mutations. 

MATERIALS AND METHODS 

Patients and treatment 

Pediatric patients with acute myeloid leukemia (AML) enrolled in the 
Children's Cancer Group, AML clinical protocols CCG 2941 and 2961, were 



candidates for this study. Details of the aforementioned protocols are 
described elsewhere. 19,20 Evaluated patients were limited to those with 
>80% leukemic blasts in the bone marrow (BM) or peripheral blood 
sample. All samples were further enriched for leukemic blast by ficoll 
preparation. DNA was extracted by Qiagen (Valencia, CA, USA) DNA 
extraction kit per manufacturer instructions. 



Affymetrix mapping 250K SNP/CGH arrays and data processing 
Samples were genotyped with Affymetrix 250K Nsp GeneChip Human 
Mapping arrays (Affymetrix, Inc., Santa Clara, CA, USA), which interrogated 
over 262 000 loci and allowed the evaluation of genomic copy number and 
LOH. DNA was digested with restriction enzymes, amplified by PCR, 
purified, labeled, fragmented and hybridized to the arrays according to the 
manufacturer's instructions. Briefly, 250 ng of DNA was digested with Xba\, 
Hind\\\ or Sty\ (New England Biolabs, Boston, MA, USA). Digested DNA was 
adaptor-ligated and PCR-amplified using AmpliTaq Gold (Applied 
Biosystems, Foster City, CA, USA) in 1 00 liI PCR reactions for each 
enzyme-digested sample. PCR products from each reaction were pooled, 
concentrated and fragmented using DNase I (Invitrogen, Carlsbad, CA, 
USA). Fragmented PCR products were then labeled, denatured and 
hybridized to the arrays. Arrays were washed using Affymetrix fluidics 
stations, and scanned using the GeneChip Scanner 3000 (Affymetrix, Inc.). 
CEL files were generated using Affymetrix GeneChip Genotyping Analysis 
Software (GTYPE) v4.0. 21 

SNP calls were generated using GTYPE by incorporating the Copy 
Number Analysis Tool (CNAT) 4.0.1 package. 22 Samples (85) passed the 
recommended minimum dynamic model call rate of 93%. Log2 signal 
ratios were estimated using the unpaired-copy-number workflow with a 
Gaussian smoothing window of 100 000 bp. Copy-number states were 
estimated by a Hidden Markov Model using CNAT as per the 
recommended parameters. LOH was estimated using the unpaired-LOH 
workflow on genotype files produced by the BRLMM algorithm at default 
settings. The reference set was comprised of 51 normal samples from the 
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International HapMap Project. Results were visualized with the Partek 
Genomics Suite, v. 6.07.0801. The algorithms utilized by CNAT 4.0 have 
been previously described. 21 

Fluorescence-activated cell sorting for colony-forming cells 
Fluorescence-activated cell sorting purification for hematopoietic progeni- 
tor subpopulations for colony-forming cell assays on methylcellulose, as 
well as fluorescence-activated cell sorting-based single-cell sorts for PCR 
were performed using anti-CD34-fluorescein isothiocyanate and anti- 
CD33-phycoerythrin (Becton Dickinson, San Jose, CA, USA) with appro- 
priate isotype controls as previously described. 23 Propidium iodide (Sigma- 
Aldrich, St. Louis, MO, USA) was used to select viable cells. Desired 
populations were selected on a Vantage flow cytometer (Becton Dickinson) 
and collected in Iscove's modified Dulbecco's medium (Invitrogen). 
Colonies were grown in methylcellulose culture supplemented with 
granulocyte colony-stimulating factor, granulocyte-macrophage colony- 
stimulating factor, stem cell factor, interleukin-3, interleukin-6 and 
erythropoietin (all from PeproTech, Rocky Hill, NJ, USA) as previously 
described. 23 

Single-cell FLT3 genotyping 

Individual colonies were harvested from methylcellulose and sorted as 
individual cells into 20 (il of RNase-free water supplemented with 10 U 
RNaseOUT RNase Inhibitor and 1 mM dithiothreitol (Invitrogen) on Vantage 
flow cytometer. FLT3 genotyping was carried out using nested touch-down 
PCR. First-round amplification was performed using primers ITDA-F 
(5'-CCTGATTGTCTGTGGGGAGT)-3' and ITDB-R (5'-ACCCCAATCCACTCCAT 
TTT-3') in a final reaction volume of 50|il, consisting of 25|il 2 x Failsafe 
PCR Premix D (Epicentre, Madison, Wl, USA), 2.5 \x\ PCR-grade water, 2.5 U 
Platinum Taq (Invitrogen), 0.2 |im of each primer and 20|il single-cell 
template. Cycling conditions were as follows: initial denaturation, 94 °C for 
5min, followed by 15 cycles of 94 °C for 30 s denaturation, 70-56 °C 
(A - 1 7cycle) for 45 s annealing and 72 °C for 1 min extension; this was 
followed by another 25 cycles with the same parameters, but with 
annealing temperature of 56 °C and completed with a final extension of 
7 min at 72 °C. Second-round amplification was performed using primers 
CG13F-6FAM (5 / -[6FAM]TGCCTATTCCTAACTGACTCATCA-3 / ) and CG14R 
(S'-TCTTTGTTGCTGTCCTTCCA-S') and a 25-|il reaction mix consisting of 
1 2.5 |il 2 x Failsafe PCR Premix B (Epicentre), 6.25 |il PCR-grade water, 2.5 U 
Platinum Taq (Invitrogen), 0.2 |im of each primer and 5(il template DNA 
from the first-round amplification described above. Touch-down cycling 
conditions were as listed above, with the exception of the annealing and 
extension times were reduced to 30 s each. 

Fluorescence in situ hybridization (FISH) evaluation of colony- 
forming cell colonies 

FISH was performed using bacterial artificial chromosome (BAC) RP11- 
136G6 (13q12.2) that contains the FLT3 gene, and BAC RP11-408E5 
(13q12.11, near the centromere) as a control for chromosome 13 copy 
number. BAC DNA was isolated from a single-colony culture using the 
Large-Construct Kit according to the manufacturer's protocol (Qiagen). 
DNA was labeled by nick translation using SpectrumOrange dUTP 
(RP11-136G6) or SpectrumGreen dUTP (RP11-408E5) following the 
manufacturer protocol (Vysis Inc., Downers Grove, IL, USA). Labeled BAC 
DNA was precipitated, resuspended in LSI/WCP hybridization buffer (Vysis 
Inc.), and hybridized to slides according to the manufacturer's protocol. 
Labeled BAC DNA was initially hybridized to normal human metaphase 
cells to verify probe localization to the correct band on chromosome 13. 
At least two and up to 20 independent colonies from each cell population 
(30-323 total interphase nuclei per population) were scored. 

Microarray expression analyses 

Details of subjects included in the microarray studies are provided in 
Supplementary Table 1. RNA was extracted from BM and peripheral blood 
samples, and quality of RNA was analyzed on HP 2100 Bioanalyzer (Agilent 
Technologies, Palo Alto, CA, USA). 24 DNA from available samples was 
evaluated for NPM1, DNMT3A, IDH1 and IDH2 mutations as previously 
described 25-27 Five micrograms of total RNA underwent biotin-labeling 
using the Enzo BioArray HighYield RNA Transcript Labeling Kit (Enzo 
Life Technologies, Farmingdale, NY, USA) as per manufacturer's 
recommendations. The biotin-labeled product was fragmented as per 
the standard Affymetrix protocol 24 Fifteen micrograms of fragmented 



antisense cRNA was then hybridized to the HG-U133A arrays (Affymetrix, 
Inc.). DAT, CEL and CHP files were generated using GCOS 1.2.1 and MAS 5.0 
software (Affymetrix, Inc.) as previously described. 24 All microarray data 
met previously determined quality controls. 28 CEL files were normalized 
using robust multi-array average (gcRMA) 29,30 

Log2 expression values were analyzed using GenePlus software (Enodar 
Biologic, Seattle, WA, USA) 24,31 Two separate analyses were performed. 
The first analysis directly compared the expression profile between eight 
AML samples without FLT3/\JDs {FLT3/w\\d type (WT)) and five AML 
samples with high F/.73/ITD-AR (F/.73/ITD H -ar). 24 ' 31 Expression changes with 
a Z-score ^4.75 or ^-4.75 were deemed to be statistically 
significant. 32,33 The second analysis examined expression changes across 
the three different groups: normal CD34 cells from BM (BM34), FLT3/WX 
and F/.73/ITDh-ar. For this analysis, the three groups were assigned a group 
number (BM34 = 0, F/T3/WT= 1 and F/.r3/ITD H - AR = 2). A linear regression 
analysis examined the expression changes across the three groups. 24,31 
Expression changes with a Z-score ^4.75 or ^ -4.75 were deemed to be 
statistically significant. 32,33 Detailed clinical, cytogenetic and F/.73/ITD-AR 
information, along with the CEL files for each of the normal and AML 
samples, are available at the Gene Expression Omnibus 34 All normal BM34 
samples and a portion of the AML samples have previously been used for 
other microarray analyses 24,31 

Network generation and pathway analyses 

Affymetrix IDs and Z-scores for significant genes were analyzed using 
Ingenuity Pathways Analysis (IPA) software (Qiagen, Redwood City, CA, 
USA). Using the IPA software, we determined the significance of functional 
and canonical pathways as previously described 24 All relationships were 
supported by at least one literature reference and/or information from the 
Ingenuity Pathways Knowledge Base 35 Functions and canonical pathways 
with a P-value ^0.05 were considered to be statistically significant 24,36 
Networks were generated using genes identified from the microarray 
analyses and IPA Network algorithm. A Network Score was based on 
hypergeometric distribution and calculated using the right-tailed Fisher 
Exact Test, taking into account total number of network eligible genes in 
the Knowledge Base, the number of genes analyzed and network size: the 
higher the score, the less likely that the observed fit between the analyzed 
genes and the network was due to chance 24,36 



RESULTS 

SNP/CGH genomic profiling identifies CN-LOH at 13q locus 
Diagnostic specimens from 85 patients with and without FLT3/\JDs 
were analyzed by SNP/CGH arrays. To control for the potential 
impact of blast variability, all specimens had >80% blasts at 
diagnosis, and leukemic blast populations were further enriched 
before the study by ficoll processing. The 85 patients displayed the 
following genotypes: FLT3/\JD 35/85 (41%), F/T3/ALM 11/85 (13%) 
and FLT3/WT 39/85 (46%). Patients with FLT3/\JD had ARs (that is, 
ratio of ITD to WT product) ranging from 0.2 to 7.05. 

Using data from the Affymetrix Nsp arrays, chromosome 13 was 
evaluated for copy-number and LOH alterations, and analyses 
compared with samples with and without FLT3 mutations. 
Unclustered analysis of the entire cohort demonstrated large 
region of LOH in specimens from 12/85 patients (Figure 1a). 
Clustering analyses revealed that all samples with LOH 13q 
contained FLT3/\JDs (Figures 1b and c), and the ITD-AR was 
substantially higher in those FLT3/WD samples with LOH 13q 
(median = 2.5, range 1.21-8.88) as compared with FLT3AJD 
samples without LOH 13q (median = 0.65, range 0.20-0.88). Those 
with LOH 13q were further analyzed for copy-number alterations to 
determine whether observed LOH was due to large segmental 
deletion or CN-LOH. The observed LOH 13q was not due to 
deletion, thus indicating the presence of CN-LOH at 13q 
(CN-LOH 13q ) in 12/35 specimens with F/.73/ITD. Furthermore, the 
region of CN-LOH conversion always involved the FLT3 gene 
(Figure 1d), and the LOH extended from a locus that was 
centromeric to FLT3 and encompassed the entire distal part of 
13q. To define the transition point for the CN-LOH conversion, a 
detailed evaluation of the centromeric locus was performed. In 
11/12 cases with CN-LOH 13q , the CN-LOH transition occurred at 
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Figure 1. CN-LOH in FLT3/ITD samples with high allelic ratio, (a) Unclustered profile for chr13 for the entire cohort demonstrates segmental 
LOH (red color) in a subset of samples, (b) Clustering based on FLT3 genotype shows that a large segmental LOH segregates with FLT3/UD. (c) 
Clustering of FLT3/UD AML by LOH 13q/ segregates samples based on allelic ratio, (d) Simultaneous copy number (C#) and LOH alteration in five 
representative patients with LOH 13q are shown (left panel). C# state represents patients with or without any copy-number changes. CN state of 
1 represents no loss of copy number. Deletions are represented as a downward deflection (for example, patient 4, del9q, £) and any 
duplication or trisomy are represented with a upward deflection from baseline (for example, patients 4 and 5, trisomy 8, ^). L0H state is 
represented as either 0 (no LOH, baseline) or 1 (LOH, upward deflection, patients 1-5, chr. 13, ►). Inset is a close up of Chr13 for patient 
number 5, demonstrating the region of transition into LOH (right panel), (e) Figure shows BFU-E or CFU-GM colonies derived from 
CD34 + /CD33" myeloid progenitors and CFU-GM colonies derived from more differentiated CD34 + /CD33 + cells. The cells were subjected 
to FLT3 genotyping by PCR and LOH determination by STR analysis and FISH. Colonies derived from CD34 + /CD33~ progenitors 
were heterozygous for FLT3/YTD (gel), with no LOH detected by STR and no deletions by FISH (green signals — control probe near chromosome 
13 centromere, red signals — BAC probe that includes FLT3). Cultured colonies from more differentiated CD34 + /CD33 + progenitors were 
homozygous for FLT3/UD (gel), had LOH by STR, but showed no evidence of deletion by FISH, suggesting CN-LOH. 



1 3q1 2.1 , and in the remaining case, the entire 13q had undergone 
conversion. 



Single-cell analysis to detect subpopulations of leukemic blasts 
with potential CN-LOH 13q 

Given the intra-individual genomic heterogeneity of AML, we 
examined whether subpopulations of leukemic blasts may harbor 
CN-LOH 13q , which would be undetected by standard SNP/CGH 
analyses. Individual leukemic blasts from five patients with known 
FLT3ATD without previously identifiable CN-LOH 13q by SNP/CGH 
analyses (see Figure 1d) were sorted into PCR tubes, and single- 
cell PCR assays were utilized to define FLT3AJD status. FLT3 signal 



recovery for the single-cell assays ranged from 50 to 85% in each 
case. In each of the five diagnostic samples, three genotypic 
subpopulations were identified: (1) heterozygous FLT3/\JD, (2) 
homozygous FLT3/WD and (3) FLT3/WT (Table 1). The homozygous 
FLT3/\JD genotype, likely owing to CN-LOH 13q , was demonstrated 
in 13-34% cells of the five samples tested. These findings suggest 
that patients with low FZT3/ITD-AR at diagnosis may harbor 
CN-LOH 13q in minor subpopulations of leukemic blasts. Single-cell 
analysis of paired diagnostic and relapsed samples from two 
patients demonstrated that the homozygous FLT3/WD genotype 
became the dominant genotype at relapse (Table 1), indicating a 
clonal evolution to a more uniform population of leukemic blasts 
harboring CN-LOH 13q upon relapse. 
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Table 1. Distribution of molecular subtypes of FLT3 gene in a single-cell assay 



Diagnostic 



Relapse 





Heterozygous ITD 


Homozygous ITD 


FLT3/WT 


Heterozygous ITD 


Homozygous ITD 


FLT3/WT 


Pt 1 


27% 


21% 


62% 


10% 


83% 


7% 


Pt 2 


32% 


32% 


37% 


12% 


77% 


11% 


Pt 3 


3% 


13% 


83% 


ND 


ND 


ND 


Pt 4 


21% 


26% 


53% 


ND 


ND 


ND 


Pt 5 


16% 


34% 


50% 


ND 


ND 


ND 



Abbreviations: ITD, internal tandem duplication; ND, not determined; WT, wild type. 



Table 2. Top functions for genes with significant expression differences between FLT3/\NT and FLT3/\TD H . AR samples 



Function 


P-value 


Number of genes associated with function 


a Cancer 


4.50E-04-3.70E-02 


28 


a Cellular growth and proliferation 


8.66E-05-3.57E-02 


23 


hematological system development and function 
a lmmune response 


3.82E-05-3.67E-02 


16 


1.86E-04-3.67E-02 


15 


Immune and lymphocyte development and function 


1.86E-04-3.57E-02 


14 


Small molecule biochemistry 


9.31E-05-3.53E-02 


12 


hematologic disease 


4.61E-04-3.70E-02 


12 


Cellular development 


3.82E-05-3.81E-02 


11 


immunologic disease 


4.50E-04-3.33E-02 


11 


Neurological disease 
a Cell cycle 


1.09E-03-3.33E-02 


9 


1.31E-03-3.56E-02 


9 


Molecular transport 


9.31E-05-3.53E-02 


8 


Tissue development 


1.86E-04-3.33E-02 


7 


a Cell morphology 


4.61E-04-3.33E-02 


7 


Lipid metabolism 


9.31E-05-3.53E-02 


6 


Cellular compromise 


4.61E-04-3.33E-02 


5 


Genetic disorders 


1.09E-03-3.33E-02 


5 


Gene expression 


3.08E-04-2.78E-02 


4 


Nucleic acid metabolism 


1.09E-03-3.33E-02 


4 


Hepatic system disease 


1.66E-03-1.66E-03 


2 


Abbreviations: ITD, internal tandem duplication; WT, wild type. Represents an overlap of top functions between the two analyses evaluating (a) FLT3 /WT and 



FZ. 73/ITDh-ar and (b) BM34, FLT3ANJ and FZT3/ITD H -ar. 



CN-LOH 13q is a late event in AML pathogenesis 
In order to evaluate the sequence of events leading to CN-LOH 13q/ 
CD34+/CD33" and CD34 + /CD33 + progenitor cells from two 
patient samples with CN-LOH 13q (high FZT3/ITD-AR) were isolated 
and subjected to colony-forming cell culture. Erythroid burst- 
forming unit (BFU-E) and granulocyte, monocyte colony-forming 
unit (CFU-GM) colonies were sequentially evaluated by PCR for 
FLT3/\JD status, short tandem repeat analyses (STR) for LOH, and 
FISH for genetic loss. BFU-E and CFU-GM colonies grown from 
CD34 + /CD33~ leukemic blasts (less differentiated population) 
demonstrated heterozygous ITD and no LOH by STR, indicating 
that the less differentiated progenitors within the colony harbored 
FLT3/\JDs but had not yet developed LOH. As expected, the FISH 
results for this population uniformly demonstrated two copies of 
chromosome 13 and no deletion of FLT3. In contrast, CFU-GM 
colonies cultured from more differentiated CD34 + /CD33 + 
leukemic blasts uniformly demonstrated homozygous ITD and 
LOH 13q by STR evaluation but still showed no evidence of 
chromosome 13 loss or deletion of FLT3 by FISH (Figure 1e). This 
data demonstrates that FLT3ATD evolves initially in the progenitor 
population as a heterozygous clone (that is, no CN-LOH 13q ), and 
subsequent events likely lead to the evolution of CN-LOH 13q in the 
more differentiated population (Figure 1e). 

Gene expression changes associated with CN-LOH 13q 

To identify expression changes potentially associated with the 

development of homozygous ITD status and subsequent 



CN-LOH 13 , microarray studies were performed using leukemic 
samples from AML patients without FLT3/\TD (FLT3/\NT, N = 8) and 
those with very high FLT3ATD ARs (FLT3/\TD H _ ARf N = 5). To reduce 
the potential impact of normal hematopoietic cells and leukemic 
maturation stage on the expression profile, only AML samples 
with high blast counts (>80%) that expressed CD34 were selected 
for the microarray studies (Supplementary Table 1). A direct 
comparison of expression profiles between FLT3/WT and FLT3/ 
ITDh-ar identified 84 genes and 2 unique probe sets (that is, 
sequences without an annotated gene) with significant expression 
differences between the two sample groups (Supplementary 
Table 2). Using FLT3/WT samples as the baseline, 43 genes and 1 
unique probe set displayed increased expression in the FLT3/\JD H . AR 
samples, four of which were HOX family members, whereas 41 
genes and 1 unique probe set displayed a decreased expression 
(Supplementary Figure 1). The uniformity in expression of top 
candidate genes suggests that presence of other known muta- 
tions (for example, NPM1 and IDH1) may not be a major 
contributor to expression differences observed between FLT3I 
ITDh-ar and FLT3ANJ samples (Supplementary Table 1). 

IPA software was used to identify significant functions, networks 
and canonical pathways associated with the 84 identified genes. 
The top 20 functions associated with the 84 genes are listed in 
Table 2, and the top 12 networks are provided in Supplementary 
Table 3. The most significant functions were 1 cancer, 2 cellular 
growth and proliferation and 3 hematologic system function and 
development. Not surprisingly, the network with the highest score 
was comprised of genes associated with similar functions: cellular 
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proliferation, hematologic development and function, and 
immune response. The only canonical pathway associated with 
the 84 genes was the cell cycle pathway that regulates G2/M 
checkpoint for DNA damage (that is, cell cycle: G2/M DNA damage 
checkpoint regulation). 

AML samples can harbor CN-LOH at loci other than 13q. 17 ' 18 ' 37-40 
Thus, AML samples with CN-LOH at other loci may display the 
same or similar expression changes as those with CN-LOH 13q . 
To further examine potential expression changes associated with 
CN-LOH, we incorporated the expression profiles of normal CD34 
cells from the BM (BM34, N = 8) into our analyses, which are 
unlikely to harbor CN-LOH. 41 For these analyses, samples were 
divided into three groups based on the likelihood of harboring 
CN-LOH: normal BM34 (unlikely to harbor CN-LOH), FLT3/WT 
(unlikely to have CN-LOH 13 but potentially harboring CN-LOH at 
other regions) and FLT3/\JD H . AR (CN-LOH 13 likely in the majority of 
blasts and potential CN-LOH at other loci). The normal BM34 were 
designated as the 'control' and given the covariate of 0, whereas 
FLT3/\NJ and FLT3/\JD H . AR were designated as having covariates of 
1 and 2, respectively. Linear regression analysis identified 642 
genes and 3 unique probe sets with significant expression 
changes (Supplementary Table 4). The analysis identified genes 
with relatively heterogeneous expression changes within the 
FLT3/WT group, which became more homogenous in the FLT3/\JD H . AR 
samples (Figures 2a and b). The presence of other mutations did 
not appear to explain these mutation differences. Approximately 
55% of the genes/unique probe sets (A/ = 364) displayed a linear 
increase in expression with increasing certainty of CN-LOH 
presence, while 45% (N = 281) displayed decreasing expression 
with increasing certainty of CN-LOH presence (Figure 2c). Thirty- 
three of these genes were also identified in the previous analysis 
comparing AML patients with FLT3/WT vs FLT3/\JD H . AR . 

Similar to the first analysis, IPA software was used to identify 
significant functions, networks and canonical pathways associated 
with the 642 significant genes. The top 20 functions are provided 
in Table 3, and as expected, there was considerable overlap 
among the significant functions associated with 84 and 642 gene 
sets (Tables 2 and 3). The top five networks are provided in 
Supplementary Table 5. The network with the highest score was 
comprised of genes associated with cell cycle, cellular movement 
and cancer. When considering the top 20 networks 
(Supplementary Table 6), the three most frequently represented 
functions were (1) cancer, (2) cell cycle and (3) DNA replication, 
recombination and repair. As DNA replication, recombination and 
repair likely have a central role in the development of CN-LOH, we 
examined the genes associated with these functions and their 
relationship in more detail (Table 4). All the genes within this 



Figure 2. Linear expression changes associated with likelihood of 
CN-LOH. (a) Figure shows the log2 expression (y-axis) of stabilin 1 
(57/487, potential role in regulating lipid metabolism and angiogen- 
esis) across normal BM34 (A/ = 8), FLT3/WT {N = 8) and FLT3/\JD H . AR 
{N = 5) samples. As depicted in the figure, the expression 
significantly increases (Z= 17.09, P<0.01), yet there is considerable 
heterogeneity in its expression within the FLT3/\NT group, (b) Figure 
shows the log2 expression (y-axis) of serpin pepitidase inhibitor, 
clade B {SERPINE2, potential role in regulating apoptosis and 
differentiation) across normal BM34, FLT3/\NT and FLT3/\JD H . AR 
samples. As depicted in the figure, the expression significantly 
increases (Z= -20.90, P<0.01), and as with STAB1, there is 
heterogeneity in its expression within the FLT3/\NT group, 
(c) Heatmaps show increasing and decreasing expression (y-axis) 
of 364 and 281 genes from two analyses, respectively, across the 
three different subpopulations of cells (x-axis). Expression range is 
provided in the lower right corner. Expression for each gene within a 
group represents the average expression of the gene across all 
samples within the group. Overall, as the likelihood of CN-LOH 
increases, so does the uniformity of the expression and differences 
in the expression within the normal and malignant cell populations. 



functional category displayed decreasing expression with increas- 
ing likelihood of CN-LOH being present (that is, BM34 to FLT3/WT 
to FLB/ITDh-ar). 



DISCUSSION 

In this study, we demonstrate that not only CN-LOH 13q is detected 
in a subset of FLT3/WD patients with very high AR (AR > 1), but we 
show lack of CN-LOH 13q in patients without FLT3/\JD, suggesting 
a potential causal link between FLT3/WD and evolution of 
CN-LOH 13q . We also demonstrate that presence of CN-LOH 13q is 
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Table 3. Top functions for genes with significant linear expression changes in BM34, FLT3/WT and FLT3/ITD H -ar samples 



Function P-value Number of genes associated with function 



a 

Cancer 


1 .67E-08- 


-1 .36E-02 


196 


a Cellular growth and proliferation 


5.91 E-06- 


-1 .33E-02 


1 73 


Cell death 


1 .37E-06- 


1 Trir r\—\ 

-1 .39E-02 


158 


Hematologic disease 

1 1 1 

Cell cycle 


z.4Ut-UD- 


- 1 .oot-Uz 


I U4 


y.zob-07- 


-1 .41 E-02 


92 


a lmmunologic disease 


I .O I t-lo- 


- 1 .oDt-Uz 


y i 


Cellular movement 


8.03E-07- 


-1 .36E-02 


91 


hematological system development and function 


o.Uot-U/- 


- 1 .oDt-Uz 


OO 


Inflammatory disease 
a lmmune response 


3.51 E-07- 


-1 .41 E-02 


85 


8.03E-07- 


-1.41 E-02 


75 


DNA replication, recombination and repair 


2.79E-07- 


-1.29E-02 


73 


Gastrointestinal disease 


3.53E-06- 


-1.36E-02 


68 


Skeletal and muscular disorders 


4.24E-06- 


-1.36E-02 


60 


a Cell morphology 


4.81E-06- 


-1.36E-02 


58 


Connective tissue disorders 


9.86E-07- 


-3.79E-05 


53 


Reproductive system diseases 


1.70E-06- 


-1.03E-02 


46 


Cell-to-cell signaling and interaction 


1.35E-05- 


-1.36E-02 


39 


Cellular assembly and organization 


2.79E-07- 


-1.36E-02 


29 


Cell function and maintenance 


1.35E-05- 


-1.36E-02 


23 


Renal and urological disease 


4.24E-05- 


-1.36E-02 


18 



Abbreviations: ITD, internal tandem duplication; WT, wild type. Represents an overlap of top functions between the two analyses evaluating (a) FLB/WTand 
F/T3/ITD H -ar and (b) BM34, FLT3/WT and FZT3/ITD H -ar. 



Table 4. Downregulated genes within DNA replication, recombination and repair pathways associated with FLT3/ITD H -ar 



Process annotation P-value Genes/molecules Number of molecules 



Recombination of DNA 2.50E-03 CHEK1, DKFZP762E1312, HMGB1, KPNA2, MSH6, REC8, RUVBL2, XRCC5 8 

Checkpoint control 3.67E-03 BUB3, BUB1, CDC6, CDC20, CDK2, CHEK1, MCM7, NDC80 8 

Double-stranded DNA break repair 4.96E-03 FEN1, KPNA2, POM 7, SOD1, SOD2, XRCC5 6 

Abbreviation: ITD, internal tandem duplication. Process associated with the overall function category (first column), P-value for significance of the process 
(column 2), genes within the process (column 3, all with decreasing expression from BM34 to FLT3/WT to FLT3/ITDH-AR), and number of genes within the 
network (column 4) are provided. Overall, the data suggest that the expression of genes is downregulated in these functional pathways as the likelihood of 
harboring CN-LOH increases. 



not limited to those with very high ITD-AR, but in fact, as 
demonstrated by single-cell FLT3 genotyping, CN-LOH 13q can be 
present as a minor clone in FLT3/\JD cases with low AR, bringing 
into question what are the additional events that may lead to 
clonal dominance of this homozygous clone. The results from our 
studies indicate that the evolution of CN-LOH 13q likely represents 
a late event in the pathogenesis of high FLT3/\JD-ARs, such that 
FLT3/WD initially develops as a heterozygous mutation with the 
corresponding WT FLT3 allele. Over time, the heterozygous FLT3/ 
ITD blasts acquire CN-LOH 13q/ resulting in homozygous mutated 
alleles and a high FLT3ATD-AR. 

Previous genome-wide studies have identified CN-LOH at other 
loci in AML and in a variety of different hematopoietic 
malignancies. 1 7,1 8,37-40,42 These other loci also frequently contain 
oncogenes or tumor suppressors (for example, TP53, MPL, NRA5 
and so on). 39 As with FLT3/\TD and CN-LOH 13q , the development 
of CN-LOH at these loci also results in homozygous mutant alleles 
and the loss of expression of the WT protein. The relative 
expression of mutant vs WT proteins (that is, 'mutant dosage 
effect') likely has a critical role in malignant transformation and 
disease progression, 43 and a mutant dosage effect has been 
demonstrated for RAS mutations 44-50 Like FLT3/\TDs, RAS 
mutations enhance proliferation and likely provide a growth 
advantage 44,48 " 50 For example, hematopoietic progenitors with 
homozygous NRA5 mutations develop a phenotypically different 
hematopoietic malignancy than similar progenitors harboring 
heterozygous NRAS mutations. 50 In the case of FLT3/\TDs, 
CN-LOH 13q also likely provides a growth and/or survival 



advantage through the establishment of a homozygous FLT3AJD 
state 43 Higher FZT3/ITD-AR (that is, likely to harbor CN-LOH 13q ) are 
associated with higher relapse rates (that is, more resistant 
disease), 14 and are found more frequently at relapse. 

Few studies have examined the temporal relationship between 
the development of mutations and CN-LOH. Wang et a/. 50,51 used 
a Cre-Lox murine model to introduce a single NRAS mutant allele 
into hematopoietic stem cells. Using this model, they showed that 
~40% of mice harboring a heterozygous NRAS mutation 
subsequently acquired CN-LOH at the NRAS locus, resulting in a 
homozygous NRAS mutant state. Recently, similar murine models 
have been used to examine the cooperative nature between 
FLT3/\JDs and NPM1 mutations 43 In these experiments, transgenic 
mice harboring heterozygous mutations for both FLT3/WD and 
NPM1 rapidly developed an AML-like disease. Interestingly, LOH of 
the WT FLT3 locus was observed in a very high percentage of the 
mice with the most rapid disease progression 43 Thus, the 
available data from NRAS and FLT3AJD murine models are in 
keeping with our findings, which indicate that FL73/ITD mutations 
in humans likely precede the development of LOH. 

Several theories have been proposed to explain the potential 
mechanism linking genomic mutations and CN-LOH 39,52 
Expanding upon previous mechanistic models, it can be 
hypothesized that some mutations, particularly those with 
extensive deletions or insertions, may destabilize the DNA 
structure in ways that directly promote CN-LOH during mitotic 
recombination or during failed attempts to correct the mutant 
allele. According to this model, CN-LOH develops in a 
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'deterministic' manner, occurring initially at the mutated locus. 
Alternatively, CN-LOH may develop in a more 'stochastic' manner. 
According to this model, CN-LOH randomly develops throughout 
the genome of leukemic blasts. Clones developing CN-LOH at loci 
containing advantageous mutations are perpetuated, while those 
clones developing CN-LOH in normal allelic regions will have no 
survival advantage and eventually perish. This stochastic model 
does not exclude the possibility that the mutations have a role in 
the initial development of CN-LOH. For example, certain 
oncogenic mutations may disrupt key regulatory pathways that 
control genomic fidelity, apoptosis and proliferation. In the case of 
FLT3/\JD mutations, the fact that CN-LOH 13q is not observed in 
FLT3/WT cases is highly suggestive of causal link between FLT3/\JD 
and evolution of CN-LOH 13q . In addition, the expression data from 
others suggest that many pathways are disrupted in leukemic 
blasts harboring ITD mutations. 53-55 In the current study, we 
identified a number of expression differences between FLT3/WT 
and FZ-B/ITDh-ar (Figure 2; Tables 2-4). Some of these expression 
changes involve the DNA repair/recombination pathway, which, if 
antedate the development of CN-LOH, may have a role in its 
development. Hence, the disruption of these critical functions may 
predispose cells to the development of CN-LOH throughout the 
genome, eventually leading to the emergence of a dominant 
clone with FLT3/\JD H . AR . However, the number of samples 
evaluated for expression studies were limited, and we cannot 
rule out that other potential confounders may be contributing to 
the observed expression changes. Additional studies are needed 
to better dissect out the molecular mechanisms underlying these 
expression differences and determine the potential etiology 
driving the development of CN-LOH in FLT3/\TD blasts. 
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